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Abstract. We consider commutative regular and context-free grammars, or, in other 
words, Parikh images of regular and context-free languages. By using linear algebra and a 
branching analog of the classic Euler theorem, we show that, under an assumption that the 
terminal alphabet is fixed, the membership problem for regular grammars (given v in binary 
and a regular commutative grammar G, does G generate v?) is P, and that the equivalence 
problem for context free grammars (do Gi and G 2 generate the same language?) is in 112 . 


1. Introduction 

Let S be a finite alphabet. By S* we denote the set of words over S, or finite sequences of 
elements of S. For a word u) G S*, by ^{w) (the Parikh image of w) we denote the function 
from S to non-negative integers N, such that each x € T, appears ^{w){x) times in w. For a 
language L C S*, 'I'(L) = {'L(u;) : w £ L} C 

Context free and regular languages are one of the most important classes of languages in 
computer science |HU 79 j . By a famous result of Parikh |Par 66 j , a subset of is a Parikh 
image of a context free language if and only if it is a semilinear set, or a union of finitely 
many linear sets. 

In this paper, we explore the complexity of various problems related to Parikh images 
of context free languages, such as the following: 

• Membership: Given a context-free grammar G and v £ (given in binary). Is u a 
member of the 'I'(G), the Parikh image of the language generated by G? 

• Universality: Given two context-free grammars G, is ^'(G) equal to N^? 

• Inclusion: Given two context-free grammars Gi and G2, does 'I'(Gi) C \I'(G2)? 

• Equality: Given two context-free grammars Gi and G2, does ^'(Gi) = 'I'(G2)? 

• Disjointness: Given two context-free grammars Gi and G2, is 'I'(Gi) n 'I'(G2) nonempty? 

Since in this paper we are never interested in the order of terminals or non-terminals, 
we treat everything in a commutative way. This allows us to identify the commutative 
languages (subsets of S*) with their Parikh images (subsets of N^). 

2012 ACM CCS: [Theory of computation]: Models of computation—Abstract machines; Formal 
languages and automata theory—Grammars and context-free languages. 
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In the non-commutative case, the size of alphabet usually does not matter very much: 
larger alphabets can be encoded as words over smaller alphabets, for example the alphabet 
{a, b, c} can be encoded as {b, ba, baa} or {a, ba, bb}. This changes in the commutative case: 
each new letter in the alphabet literally brings a new dimension to the Parikh image. If 
we do not fix the size of the alphabet, it can be easily shown that even the membership 
problem for regular languages is NP complete. 

There are many practical uses of regular and context-free languages which do not care 
about the order of the letters in the word. For example, when considering regular languages 
of trees, we might be not interested in the ordering of children of a given node. |BM99] and 
|NS99) consider XML schemas allowing marking some nodes as unordered. 

Some complexity results regarding semilinear sets and commutative grammars have 
been obtained by D. Huynh |Huy80 , Huy85| , who has shown that equivalence is H^-hard 
both for semilinear sets and commutative grammars (where H^ is the dual of the second 
level of the polynomial-time hierarchy, |Sto76j L 

Some research has also been done in the field of communication-free Petri nets, or Basic 
Parallel Processes (BPP). We say that a Petri net f |Pet81j . |Rei85] i is communication-free if 
each transition has only one input. This restriction means that such a Petri net is essentially 
equivalent to a commutative context-free grammar. |Yen97j shows that the reachability 

where d is a constant 


equivalence problem for BPP-nets can be solved in DTIME (2 


and s is the size of the problem instance. For general Petri nets, reachability (membership 
in terms of grammars) is decidable |Kos82l ILerlOj , although the known algorithms require 
non-primitive recursive space; and reachability equivalence is undecidable |Hac 75]. Also, 
some harder types of equivalence problems are undecidable for BPP nets |Hiit94j . See [EN94] 
for a survey of decidability results regarding Petri nets. 

As mentioned above, we will assume in this paper that the alphabet is fixed. We have 
the following two main results: 

• The membership problem for commutative regular languages over a fixed alphabet is 
in P. This was open for a long time (even for a binary alphabet), until it was solved 
independently by the author of this paper and Anthony Widjaja Lin. This result, and its 
applications, was presented as a merged paper at the LICS conference [KTlOj . 

• The equivalence problem for commutative context-free languages over a fixed alphabet is 
in n 2 . As far as we know, there have been no successful previous attempts in this direction, 
except for the much simpler case where the alphabet has only one symbol |H^ . 


In context free grammars, usually the order of transitions used in a derivation is important: 
we are never allowed to use a non-terminal which has not yet been produced. For example, 
a transition X —)■ aX allows us to produce an arbitrary amount of the terminal symbol a, 
but only if we have access to the non-terminal symbol X. However, derivations also can 
be defined commutatively. The well known Euler’s theorem gives a necessary and sufficient 
condition for whether there is a path or cycle in a graph which uses each edge e exactly Ue 
times: the condition says that each vertex has to be entered and left exactly the same number 
of times {Euler condition) and reachable from the starting vertex {connectedness). Thus, we 
can forget the order of edges on such a path or cycle, count them, and just check that the 
conditions are satisfied; a roughly similar approach is used in the definition of a cycle in the 
construction of homology groups in algebraic topology jHatn2j . The same approach works 
in our more general case — we can define a commutative run and a commutative cycle by 
counting the number of times each transition has been used, and just like in Euler’s theorem. 
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a very simple condition can be used to check whether such a function from transitions to 
integers is indeed a Parikh image of a valid derivation. Similar technique has been used 
previously in |Esp97| ; it also has been used successfully to solve an open problem in database 


theory [Kopll 


Although we are not allowed to use a non-terminal which has not yet been produced, 
our framework straghtforwardly allows the following interesting generalization: we allow 
our terminal symbols to be produced in negative quantities. In this case, these negative 
productions do not have to be balanced by positive productions. Thus, our commutative 
languages are interpreted as subsets of rather than . Although such languages do not 
commonly appear in the theory of languages, they turn out to be interesting and useful. For 
example, such negative production allow to reduce disjointness of A and B to membership 
very easily - just check for membership ofOinA — i? = {a — h : a £ A,b £ B} (see 
Proposition 5.12 below) . This generalization was also discovered independently by Anthony 
Widjaja Lin. 


1.1. Related papers. This paper is the full version of [KTlOj . with the following major 
differences: 

• [KTIO] is a result of merging of two submissions. This paper includes only results obtained 
by the author. 


Equivalence of commutative context-free grammars (Theorem 3.2) has been generalized 


to the case where we allow terminal symbols to be produced in negative quantities. 
Universality of commutative context-free languages has been only proven to be in II^ in 
[KTlOj . Here, we prove that it is in fact H^-complete. 

Full proofs are included. 


1.2. Structure of the paper. The paper is structured as follows. 

Section introduces basic definitions and facts from linear algebra and language theory, 
and then presents bounds on the size of runs and cycles of commutative grammars. The 
techniques used to obtain these bounds are almost the same for regular and non-regular 
grammars, but the results are much stronger in the regular case. This section culminates 
in Theorem |2. 13 which gives a compact representation of a commutative regular language 
(equivalently, a Parikh image of a regular language). This compact representation will be 
used in Section to solve the membership problem of commutative regular languages in P. 

The whole Section presents a “window theorem” (Theorem 3.2) for (non-regular) 
commutative grammars. This theorem roughly says that, in order to decide whether two 
grammars are equivalent, we only have to look at a window of exponential size, and it will 
be used in Section p to show that the equivalence of commutative grammars is in H^. In 
fact, most of Section ^is the proof of Lemma 3.1 about semilinear sets, which does not refer 
commutative grammars directly; Theorem 3.2 is a simple corollary. Lemma 3.1, as well as 


some other lemmas in this section, thus could potentially have applications to semilinear 
sets in general, whether they come from commutative grammars or not. 

Section presents a non-regular grammar over a three letter alphabet which has no 
compact representation similar to one given by Theorem 2.13 We believe this example is of 
interest, because it shows why we cannot prove Theorem 3.2 using a simpler approach. 
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Section gives the answers to questions about the complexity of the problems mentioned 
in the introduction; some results are obtained directly, and for some we need our compact 
representation and window theorems from Sections and 
There is a conclusion in Section [6l 

A reader interested only in the first main result (that the membership problem of 
commutative regular languages in P) only has to read Section 2 (assuming that the grammar 
is regular) and the respective part of Section 5. Due to the lack of branching, results in 


subsections |2.3| and 2.4 can be obtained in a much more straightforward way for regular 
grammars. A reader interested in the second main result (that the equivalence problem for 
commutative context-free languages over a fixed alphabet is in II^) has to read Section 2 
(without concentrating on regular grammars), Section 3, and the respective part of Section 
5; Section 4 should also be of interest for such readers. 


2. Preliminaries 

2.1. Vectors and matrices, and semilinear sets. In this subsection we present the basic 
notation, notions and facts from linear algebra, which will be used throughout the paper. 

As usual, we denote the set of integers by Z, the set of non-negative integers by N, the 
set of rational numbers with Q, and the set of real numbers with M. We also denote the 
set of non-negative real numbers with P (we do not use the more standard notation to 
avoid double indexing). 

We also use the notation [a, b] for the interval of all real numbers between a and b, and 
[a..6] for the interval of all integers between a and b. 

Let X be a finite set. We interpret as multisets over X: z £ represents a 
multiset where each x £ X appears exactly Zx times. For a x G X, by [x] £ N'’*' we represent 
the multiset which represents x: [x](y) = 1 iff y = x, 0 otherwise. The set is a subset of 
1/^ (intuitively, we allow the elements of X to appear in negative quantities), and 

For V £ M^, let |u| = Ylx&x and ||u|| = maXjjgx 

We also interpret elements of as (column) vectors. By we denote the set 

of matrices over M with columns indexed by X, and rows indexed by T; C 

denotes matrices where all entries are in A C M. For M £ we use the notation 

My for the coefficient in row y £ Y and column x £ X. For M G N £ and 

V £ we define the multiplication in the usual way: 

Mv£R^, {Mv)y = ^ M^Vx 
x&X 

NM £ (XM)^ = N^Uy 

y& 

For a matrix M G and a set S C by MS C we denote {Ms : s £ 5}. 

Similarly, for sets 5i, S 2 C we define 5i-|-S'2 = {si -|- S 2 : si G 5i, S 2 £ *S' 2 } and 5i — ^2 = 
{si — S 2 : si G S*!, S 2 G 52 } (the algebraic sum and difference). 

For a finite set S C and A C M, we define 
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_ I ^ ; Vs G ^ 

Ues 

Typically, A will be one of Z (5®^ is the additive group generated by S'), N (S®^ is the 
additive monoid), M (5'©® the linear space spanned by S), P ig the cone spanned by 
S), or [0..H] (we limit the number of uses of each element of S). 

A set V C Z^ is linear iff it can be written as 1 ^ = no + P®^, where vq G Z^ and the 
set P C Z^ is finite. The vector vq is called the base of V, and the elements of P are called 
periods. Moreover, we say that a linear set y is a simple linear set iff the elements of P 
are linearly independent. 

A set y C Z^ is semilinear iff it is a union of hnitely many linear sets. 

A semilinear set V C Z^ is called a simple bundle iff it can be written a.sV = W +P®^, 
where W C Z^ is finite and the elements of P are linearly independent. We say that a 
bundle is bounded by (P, Y) iff ||u|| < B for each v G W, and ||p|| <Y for each p € P. 

The following lemmas from the linear algebra will be important for us. We denote the 
determinant of M G by detM. 

Lemma 2 . 1 . Let M G [—C^C]^^^ be a matrix. Then the determinant of M is bounded 
by H\x\ {C) = Moreover, if M G , the determinant of M is an integer. 

Proof. This follows straightforwardly from the Leibniz formula for determinant. In fact, 
Hadamard |Had93| has shown a better bound: H\x\ (C) = . □ 

Let us recall the well known Cramer’s rule |Cra5n| : 

Lemma 2 . 2 . Let A G be a non-degenerate matrix, and b G M^. Then the system of 
equations Av = b has a unique solution given by Vx = det[Ax)/ det(A), where Ax is obtained 
by replacing the x-th column of A with the vector b. If A is not a non-degenerate matrix, 
the system of equations either has no solution, or infinitely many solutions. 



The following corollary is straightforward: 

Lemma 2.3. Let Ai be a fixed set of indices, and A G N. There is a number P, polynomial 
in A, such that whenever the system of equations Mx = u, where M G [— A..A]^^^, 
V G [—A..A]'’^ has a unique solution x G we have x = x^jC, where x' G [—P..P]'’^, and 
C G [-P..P]. 


Proof. We apply the Cramer’s rule (Lemma 2.2) and the Hadamard bound (Lemma 2.1 ).0 
Lemma 2.4. Let M G be a non-degenerate matrix. Then 


(det M)Z^ C MZ 


7X 


Proof. Again, this is immediate from Cramer’s rule (Lemma 2.2). The intuition is as follows. 
For u,v € Z^, we say that u = viSu — v£ MZ^. The quotient group Z^/= has detM 
elements (intuitively, for |X| = 2, the number of elements is equal to the area of the 
parallelogram given by columns of M; this intuition also works in other dimensions). Thus, 
(det M)v = 0. □ 
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Lemma 2.5. Let P C [—C..C]^ be a linearly dependent set of vectors. Then for some 
a G we have Ylv&p = 0, where ||a|| < H\x\ {C)-, and > 0 for some n G P. 


Proof. Without loss of generality we can assume that P is a minimal linearly dependent set. 
Let [Pv] G be the set of coefficients for which Yhv&v ~ such that 7 ^ 0 for 

some V € P. Since P is minimal, we know that (^y) is unique up to a constant: that is, if 
{f3y) and (/3(,) have this property, then /?(, = qj3y for some g G M. 

Let u G P be the element such that \/3u\ > \Pv\ for each v £ P. Let Pq = P — {u}. We 
know that the P®'* = P®®. 

Since P was minimal, we know that Pq is linearly independent set. Let Pi P Pq be such 
that Pi C [—C..C]^, and Pi is a base of M^. This can be done by extending Pq by unit 
vectors which are not yet in the subspace spanned by Pq. 

Let M G be a matrix whose columns are the elements of Pi. Since Pi is 

a base, each vector v can be written as a linear combination of elements of Pi with real 
coefficients in a unique way. From Lemma 2.4 we know that for v = (det M)u the coefficients 
are integers; moreover, since (detM)M G P®“ = P®*, the coefficients are 0 for elements of 
Pi\Po. Thus, we get Yhv^p^vV = 0, where au = detM and ay are integers. Since {f5y) was 
unique up to a constant, and u was the vector with the largest coefficient, we know that the 
same holds for {ay). From Lemma 2.1 we know that ay < det M < P|x| (C) for all u G P. □ 


Lemma 2.6. Let P C [—C..C]^ be a finite set of vectors, and w G P®^. Let H = H\x\ {C). 
Then there is a linearly independent set Pq C P such that w G P®[0-'^] -)- P®^. In other 
words, we can assume that all the periods except ones from Pq are multiplied by factors 
bounded by H. 


Proof. Assume that w = Ylv^p'^vV- Let H = H^x\ {C). 

Let P* = {u G P : n^, > P}. Induction over the cardinality of P*. 

If P* is linearly indepen dent , we are done (we can simply set Pq = P* 
Otherwise, from Lemma 


2.5 


we know that there is a set of coefficients (a^,) G Z'^'’ such 
that YlvePo = 0, and ay < H for each u G Pq, and at least one is positive. For each 
V G Pq, we subtract ay from n„; this does not effect the equation w = repeat 

this until we get riy — Oy < 0 for some v £ Pq (this will have to eventually happen, since at 
least one ay is positive). Since Uy — ay < 0 and ay < H, we get that riy < H. Thus, the 
new set P* is a subset of the old one, and we can apply the induction hypothesis. □ 


2.2. Commutative grammars, runs, and cycles. A commutative grammar (see for 
example |EN94j l is a tuple G = {Ti,Q,qi,5), where S is a hnite alphabet of terminal 
symbols, Q is a set of non-terminal symbols, qj £ Q is the initial non-terminal 
symbol, and 5 P Q X Z^ X is the transition relation. We will write the transition 

{q, a, t) G 5 as g A t, or using a multiplicative notation: q qiq^ denotes the transition 
(g, [x] — [y], [gi] + 2[g2]). The intuition here is that our commutative grammar allows us to 
use a non-terminal symbol g to produce the multiset of terminal and non-terminal symbols, 
given by a and t; we allow our terminal symbols to be produced in negative quantities. 
For a transition d = {q,a,t) £ 5, we denote g, a and t by source((i), 'l'(d) and target((i), 
respectively. 
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We say that a grammar is positive iff ^{d) >0 for each d € 5. Note that our definition 
is more general than the usual one (e.g., |EN94j h since we allow non-positive commutative 
grammars. 

Runs, paths, and cycles are usually defined as sequences or trees of transitions. However, 
we will define them as multisets of transitions. The next subsection will show how our 
definitions are related to the usual ones. 

Let i? G be a multiset of transitions. Let source(i?) = Rrf [source((i)], 'I'(R) = 
J2de5 'h(d), target(R) = YldeS ^d target((i). Intuitively, the number source(i2) counts 
times each non-terminal symbol has been used, while target(i?) counts the number of times 
each non-terminal symbol has been produced, and 'L {R) counts the number of times each 
terminal symbol has been produced. For p, q £ Q we say that p q iff for some d G 5 such 
that Rd > 0, p = source(d) and taiget{d){q) > 0; i.e., a transition which produces q from p 
appears in R with a non-zero quantity. By —we denote the transitive reflexive closure of 
-^R. For a multiset s £ and q £ Q, we say that s — q iS p —q for some p £ s. By 
supp(ii) C Q we denote the set of non-terminals q £ Q such that (source(ii))q > 0. 

We say that a multiset of transitions R G is a (commutative) subrun from s £ 
to t G if the following two conditions are satisfied: 

• Euler condition: source(R) — s = target(ii) — t; 

• Connectivity: whenever q £ source(R), we have s^\q. 

Intuitively, in a subrun, for each state q, we start with Sq copies of state q, during the 
run we use up sourceg(R) copies and produce targetg(i?) more copies, and tg copies remain 
at the end. The Euler condition says that these numbers agree. In the case where s = t it 
means that each state is produced exactly as many times as it is produced, just like in the 
classic Euler condition which says that each vertex has the same in-degree and out-degree. 
In the next subsection we will show that, as long as our conditions on a commutative subrun 
are satisfied, there is an ordering such that we never use up something which has not yet 
been produced. 

The following kinds of subruns will be of most interest for us: 

• A run from p is a subrun from [p] to 0. 

• A path from pi to p 2 is a subrun from [pi] to [p 2 ]. 

• A cycle from p is a path from p to p. 

By 'I'(G) we denote the (commutative) language of G, or the set of T(R) for all 
commutative runs R from qj. 

We say that G is in normal form iff for each transition (q, a, t) in 6 we have |t| < 2 and 
|a| < 1. Moreover, we say that G is regular iff for each transition (q, a, t) in 6 we have |t| < 1 
and |a| <1. If G is not in normal form, it is straightforward to construct a grammar G' in 
normal form such that 'I'(G) = T(G'). This is done by replacing each transition which does 
not satisfy the restriction by several simpler transitions, adding additional non-terminals. 
For example, q qiq 2 q 3 is replaced with q qiq' and q' ^ q 2 q 3 , where q' is a new 
non-terminal. 

Usually, we will assume that our grammars are in normal form; in this case, the size of 
the grammar can be described by stating the number of non-terminals N and the size of the 
alphabet A, since for a grammar in normal form the number of transitions is bounded by 
0{N^A). In the sequel, many results will work both for regular grammars and the general 
case, but be much stronger in the regular case - as a typical example, for regular grammars 
we can achieve a polynomial bound, but for non-regular grammars in normal form we can 
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only achieve an exponential one. Both the polynomial bound in the regular case and the 
exponential bound in the general case will be of interest for us. The notation introduced 
in Lemma 2.11 below, which is linear in N for a regular G and exponential for a non-regular 
G, should make it clear that a result is given both for regular and general grammars, but it 
is stronger for regular ones. 

We say that a cycle G is simple iff it is not a sum of two smaller non-zero cycles. We 
say that a run i? is a skeleton run iff it cannot be written as i? = + C*, where C is a 

cycle, and supp(i?) = supp(i?o). 


2.3. Commutative subruns can be ordered. In this subsection we show that our com¬ 
mutative subruns can be ordered correctly, that is, in each non-empty commutative subrun 
from s we can choose the hrst transition 5 such that the remaining part is a subrun from 
s — source((5) -|- target((5); in other words, we can order the transitions in a subrun in such a 
way that, if we start from s, and each transition consumes source(<5) and produces target((5), 
we can arrange the transitions in a way that we never consume something which has not 
been produced yet. This result is practically equivalent to Theorem 3.1 in |Esp97| , where it 
has been stated in the setting of communication-free Petri nets. Also note that, for regular 
grammars and |s| = 1, this can be seen as a restatement of the well known Euler’s theorem. 

Theorem 2.7. If ii is a non-empty subrun from s to t, then there is a 5 € R such that 
source((5) G s, and R' := R — S is a subrun from s' := s — source(5) -|- target(5) to t. 

Proof. There are two cases: 

• the relation —has a cycle starting in s, i.e., there is a p G s such that p — q and 

q — p. In this case, there is a 6 G R such that source((^) = p and target(5) = q. We 
have to show that R' is a subrun from s' to t. Let r G source (i?'). Since R was a subrun 
from s, we know that and we have to show that s' r. Take a minimal path 

TT witnessing s —r. Since vr is minimal, only its first transition starts from an element 
of s, and in particular, can be equal to 6. 

— If TT did not start in p, it is still a path for — {6 was not in vr). 

— If TT started with <5, we just remove 6 from vr, thus obtaining a path witnessing s' —r. 

— If vr started in p but not with 5, we know that q — p —r, and both subpaths do not 
contain 6 (provided that we take a minimal path q — p) and thus they are still paths 
for -fRi. 

• otherwise, take any 5 such that source((j) = p G s. Such a 5 must exist from the 

connectedness condition. Again, we have to show that R' is a subrun from s' to t. Let 
r G source(i?'). Since R was a subrun from s, we know that and we have to show 

that s' -^*Ri r. Take a minimal path witnessing s r. Since vr is minimal, only its first 
transition starts from an element of s, and in particular, can be equal to 5. There are four 
subcases: 

— If vr did not start in p, it is still a path for -g-r/ {5 was not in vr). 

— If vr started with <5, we just remove 6 from vr, thus obtaining a path witnessing s' r. 

— If vr started in p but not with 6, and Sp > 2, then s'{p) > I, so p G s' and the path is 
still valid for R'. 

— If vr started in p with S' ^ S, and Sp = I, we know that sourcep(i?) — Sp = targetp(i?) —tp. 
Since source((5) = source((j') = p and Sp = I, targetp(ii) > 0, and thus there must be a 
transition 5" such that p G target((5"). We know that s —source((5"), and the path 
cannot include S - otherwise, we get a sequence of paths s —p -g-r target(5) — 
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source((5") p, and thus we get a cycle, which was dealt with in the previous case. 
Thus, we have s —source(5") —p — r. □ 


2.4. Derivation trees. In this subsection we compare our commutative runs and cycles 
with the usual derivation trees. We obtain a subrun from a derivation tree simply by counting 
how many times each transition has been used; using the result of the previous section, we 
also show that this process can be reversed, i.e., a derivation tree exists for each commutative 
subrun. We then use the derivation trees to show upper bounds on the size of simple cycles 
and skeleton runs. 

Again, this is much easier for regular grammars — in this case, derivation trees are 
simply paths, i.e., sequences of transitions do, di, d 2 , ■ ■ ■ ,dd such that [source (Sj+i)] equals 
target (Si). 

Let G be a commutative grammar in normal form. A derivation tree from p £ Q is a 
tuple T = {V,vo, P, d), such that: 

• y is an arbitrary set of vertices, 

• uo £ is a special vertex, called the root of V, 

• P is a function from V — {uq} to V (parent), such that for each v G V there is a n G N 
(called the depth of v) such that P"(u) = vq, 

• d is a function from V to S. We will use source(u) and target(u) for source(5(u)) and 
target(cl(u)), respectively. 

• source(cl(uo)) = p, 

• for each v G V, we have P(v} > 0, where 

F(v) = target(u) — [source(rc)]. 

w:P(w)=v 

We also denote F(T) = source(T) = target(T) = 

Intuitively, each vertex v of the derivation tree represents that we are using a non¬ 
terminal and produce new terminals and non-terminals, according to the transition d(v). 
Children of v can use the non-terminals produced. F(v} represents the “free” non-terminals 
which have been produced, but have not been used by children; F(v} > 0 represents the fact 
that children cannot use non-terminals which have not been produced. 

Let U(T) : d —7- N be the function counting the number of times each transition has 
been used in the derivation tree T: (U(T))d = \{v G V : d{v) = d} [• 

We say that a derivation tree T from p is full iff F{T) = 0, and a path to p 2 iff 
F{T) = [p 2 ]- We say that a derivation tree T is cyclic from p iff it is a path derivation tree 
from p to p. 

Lemma 2.8. If T is a derivation tree from p, then U(T) is a commutative subrun from [p] 
to F{T). 
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Proof. 


^iT) = ^ ( target(i;) - 


vev 


[source(i(;)] 
[source(i;)] 


E 

w:P{w)=v 

= target(i;) - ^ 

vGV wGV—{ i^o} 

= target(T) — source(r) + [source(i;o)] 

= target([/(T)) — source(J7(r)) + [p\. 

We also have that P{v) —)'u(r) for each v £ V, thus U{T) is indeed connected from 
P{vo)=p. □ 

Theorem 2.9. If p G Q, and i? G is a commutative subrun from [p] to t G then 
R = U(T) for some derivation tree T from p such that F{T) = t. 

Proof. We will construct the tree T inductively. We start with Tq = {V, vq, P, d), where d{vo) 


is 5 from Theorem 2.7 for R, and V = uq- We will keep the following invariant: U{T) < R, 


and R — U{T) is a subrun from F{T) to t. From Theorem 2.7 the invariant is satisfied for Tq. 
Suppose that the invariant is also satisfied for T, and U{T) < R. . From Theorem 2.7 again 


there exists a 6' £ R — U{T) such that source((I') G F{T). Since source(5') G F{T), there 
exists a v such that source((5') G F{v). We obtain a new tree T' by adding a new vertex w 
such that P{w) = v and d{w) = 5'. From Theorem 2.7, the invariant is also satisfied for T'. 


Finally, we get a tree T such that U{T) = R. From Lemma 2.8 we know that F{T) = t. D 


Corollary 2.10. The following two conditions are equivalent: 

(1) 7? G N*^ is a commutative run from p, 

(2) R = U(T) for some full derivation tree T from p. 

Also, the following two conditions are equivalent: 

(1) C G is a path from pi to p 2 , 

(2) C = U(T) for some path derivation tree T from pi to p 2 . 
In particular, the following two conditions are equivalent: 

(1) C G is a cycle from p, 

(2) C = U(T) for some cyclic derivation tree T from p. 


Proof. Straightforward from Lemma 2.8 and Theorem 2.9 


□ 


Lemma 2.11. Let G be a grammar in normal form. Let H be a derivation tree such that 
D has no vertices at depth M. Then \D\ < Tq , where F^ = M + 1 if G is regular, and 
2M+1 otherwise. 

Proof. If each vertex has at most G children, then there are at most G'^ vertices at depth d. 
For regular grammars G = 1, and for grammars in normal form, G = 2. All the hypotheses 
follow from simple calculations. □ 

Theorem 2.12. Let G be a grammar in normal form with N non-terminals. If G is a 
simple cycle, then |G| < F^. If 7? is a skeleton run, then |7?| < F^ . 


Proof. First, let G be a simple cycle. From Corollary 2.10 we know that G = U(r) for some 
cyclic derivation tree T = {V, vq, P, d) from p. 
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We will show that T has no vertex at depth N. Indeed, suppose otherwise that T has a 
vertex v at depth n > N. Consider a branch of the derivation tree: v,p{v),p'^(v),... ,p'^{v) = 
vq. Since n > N, we have source(pj(i;)) = souvce{pj{v)) for some i < j. Let Vi be all 
the descendants of Pi{v) (inclusive), V 2 be all the other descendants of Pj{v), and V 3 be 
all the other vertices. Then Ti = {V 2 ,Pi{v), P, d) is a cyclic derivation tree, and so is 
T 2 = (hi U V 3 ,pi{v), P*,d), where P*{pi{v)) = P{pj{v)) and P*{w) = P{w) for all other 
re G hi U V 3 . Hence, C is not a simple cycle (C = U(T) = U(Ti) + U(T 2 )). 

Now, let i? be a skeleton run. From Corollary 2.10 we know that R = U(T) for some 
full derivation tree T = {V,vo, P, d) from qq. 

We will show that T has no vertex at depth Indeed, suppose otherwise that T has a 
vertex v at depth n > iV^. Consider a branch of the derivation tree: v,p{v),p^{v),... ,p^{v) = 
Vq. Since n > N'^, there are indices • • •, fv such that somce{pi^{v)) is the same non¬ 
terminal q for each k. By repeating the construction above N times, we can decompose 
R = Rq + Ylk=i where Rq is a run and Ck is the cycle between Pii^_-^{v) and pi^,{v). 
Let Uk = supp(i?o + Ylk=i^k- Since C/q C C/i C 1/2 ... C Un C Q, Uq is not empty, 
and IQ I = N, there must be k such that Uk = Uk-i- Since Ck does not add any new 
non-terminals to the support, we have that supp(i2 — Ck) = supp(ii). This contradicts the 
assumption that R was a skeleton run {R — is a run). 

All the hypotheses follow from the Lemma |2. 11 O 


2.5. Compact Representation of a Commntative Regular Language. In this sub¬ 
section, we will show how to obtain a compact representation of a regular commutative 
language over a fixed alphabet: such a language is a union of polynomially many polynomially 
bounded simple bundles. This compact representation will be used in Section below to 
prove that the membership problem is in P for regular commutative grammars over a fixed 
terminal language. We also obtain a less compact representation in the non-regular case. 

Theorem 2.13. Let G be a grammar in normal form with N non-terminals over an 
alphabet of size A, and let i? be a run. Let Bq = Fi^^ -|- (2TP,)^~^^Ha (^r) ■ Then 
^(R) = T(Ri) + ^ ^{Ck)nk, where: 

• Ri is a run such that |Ri| < Bq, 

• each Ck is a simple cycle from some q G supp(Ri), 

• 'l'(C'fc) are linearly independent, 

• Ufc G N. 


Proof. Let R be a run. As long as R is not a skeleton run, we can decompose R as a sum of 
a smaller run, and a simple cycle. Thus, we obtain that 'I'(R) = T(Ro) -|- YlkeK 

Suppose that T(Cfc) = ^'(G;) for some k ^ 1. Then we remove I from K, and add ni to 
Hk- The equation TllR) = 'I'(Ro) -|- Xlfeeic stiH holds. 

Let K* = {A: G R : nfc > Ha From Lemma 

linearly independent for k G K*. 

Since for each k we have |Gfc| < F^, and '^{Ck) is different for each k, we have 
at most ( 2 Fg)"^ cycles there. Let Rq = Ri -|- J2keK-K* ^k^k- We have |Ro| < |Ri| -|- 
{2T^)^+^Ha (F^), and '^(R) = T(Ro) + EkeK* '^{Ck)nk. □ 


2.6 


we can assume that T(Gfc) are 
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Corollary 2.14. If G is a regular grammar with N non-terminals over an alphabet of size 
A, then 'I'(G) is a union of at most simple bundles bounded by {Bq, N). 


Also note that Bq is polynomial for regular grammars over a fixed alphabet. 


Proof. From Theorem 2.13 we know that for each run R we have T(i?) = T(i?i)-|-^ 'h(C'fc)nfc, 
where is bounded by Bq and is a simple cycle. We bundle the runs which use the 

same cycles together. Since simple cycles are bounded by N, there are at most of them, 
and there are at most sets of linearly independent simple cycles. D 


It is possible to get a better bound on the number of simple bundles in dimension 2, 
even for non-regular grammars. 


Corollary 2.15. If G is a grammar in normal form with N non-terminals over a two-letter 
alphabet, then 'I'(G) is a union of 0{N‘^) simple bundles bounded by {Bg,Pc). 

Proof. Let S = { 01 , 02 }. First assume that the grammar is positive. 

For each non-terminal q, among all the cycles from q, let Ci{q) be the one with the 
greatest proportional amount of ai, for i = 1,2. All the other cycles from q fall in the angle 
between Gi(g) and C 2 {q). 

Now, for each run R, let Ci{R) be the one with the greatest proportional amount of o*, 
among all cycles from q G supp(i?). 

Proceed as in the proof of Corollary |2.14[ , except now we can assume that the cycles 
Gi, C2 are always Ci{R) and C2{R) (instead of arbitrary cycles), as all other cycles can be 
written as positive linear combinations of these two. Since Ci(R) and C2{R) are chosen 
from Ci{q), there at at most N'^ simple bundles. 

For non-positive grammars, for each non-terminal q, one of the two cases holds: 

• all cycles from q (and their positive combinations) fall in the angle between the two 
extreme cycles Gi(g') and C 2 {q), 

• positive linear combinations of cycles from q cover the full plane, and can be written as 
positive combinations of Ci{q), C2{q) and C'i{q) which are cycles from q. 

Thus, we can always use one of the at most three cycles Ci{q) for some q G supp(i?). There 
are still 0(A^^) combinations of them. O 


3. Window Theorem for Commutative Grammars 


This section is devoted to the following result, which we call the window theorem: in order 
to determine whether two commutative languages 'I'(Gi) and 'I'(G 2 ) over a fixed alphabet T, 
of size A defined by grammars Gi and G 2 with N non-terminals are disjoint or equal, it is 
enough to only examine a small window of size which is single exponential in N. This result 
will be instrumental in the proof that inclusion, universality, and disjointn ess p roblems are 


in 112 commutative grammars over an alphabet of hxed size (Theorem 5.5 below). 


The situation is much easier for A = 2 than in the general case. From Corollary |2. 15 
we know that each of T(Gi) and 'I'(G 2 ) is a union of 0{N‘^) simple bundles bounded by 
(^G,r^). In this case, we can show the following result: for each v G we can find a vq of 
single exponential magnitude such that vq is in exactly the same of our 0{N‘^) bundles as v. 
This is achieved in Lemma |3.5| in the following way: 
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• Let W + be one of the bundles. From Lemma 2A we know that each member of 
(detM)Z^, where M is the matrix whose columns are 2, is a member of 2^®^. Hence, if 
w G (det M)lP, then v + Z®^ iff n + re G kF + Z®'^. Since there is just a polynomial 
number of bundles, we can use the least common multiple of the determinants to ensure 
that the equivalence above is satisfied for each bundle. 

• The bundle is kF + 2^®^, not u G kF + Z®'^, thus we need to ensure that the signum of the 
coefficients remains unchanged. This is done by partitioning Mp into regions. Two points 
V, vq are in the same region if u G lu + Z®^ iff uq G + Z®^ for each w, Z satisfying the 
necessary bounds. We show that we can additionally ensure that the single exponentially 
bounded vq is in the same region as v, which proves the lemma. 

However, this approach no longer works for H > 2, as ik(Gj) no longer needs to be a union 
of polynomial number of simple bundles; section below is devoted to showing an example 
of such a grammar. 

We solve this problem by using the regions again. Although 'I'(Gi) need not be a union 
of a polynomial number of simple bundles, this is true when we consider regions separately: 
for each region r, 'k(Gi) Hr equals UHr, where C/ is a union of polynomial number of simple 
bundles. This is proven in Lemma |3.4| below. 

The rest of this section provides a detailed statement and proof of our window theorem. 
In fact, we will prove the following Lemma about semilinear sets, without directly using the 
assumption that our semiliner sets come from commutative grammars; the window theorem 


about commutative grammars (Theorem 3.2 below) will follow easily from it. 


Lemma 3.1. Let C G N. Let S'! and S2 be two semilinear sets over the same fixed alphabet 
S of size A, given as Sk = Uje4 + .2®^, where kFj C [— C'..C']^, Zi C [—for every 
f G Ii U 12 - Then there exists a number i fei = 0{{C + y)FiU/2|^ 5^ C S2 iff 

n C 52 n and 5i is disjoint with S 2 iff 5i Pi 

is disjoint with S 2 n [— 

Theorem 3.2. Let Gi and G2 be two commutative grammars in normal form with at most 
N non-terminals each, over the same fixed alphabet S of size A. Then there exists a number 
1^22] which is single exponential in N, such that 'I'(Gi) C ^(^ 2 ) iff 'I'(Gi)n [— C 
T(G2)n[-itei..ife2l]^, and T(Gi) is disjoint with 'I'(G2) iff ^(G*i)n[-ife2l.lfe2i1^ is disjoint 
with T(G2) n [-%2l-%2]|^- 

Proof of Theorem 3 ^. Let Gu = (H, Qu, Qu, Su)- Without loss of generality we can assume 
that the sets Qu are disjoint for u G {1, 2}; this way, we will be able to identify the grammar 
of each run and cycle by mentioning one of the non-terminals used. 

For q G Qu, let Zg = {'I'(C') : C is a simple cycle from g}, and for 5 C Qu, let Zs = 


From Theorem 2.13 we know that v G ^{Gu) iff there exists a run Ri from Qu such that 
|/2i| is bounded by Bq and a linearly independent set of simple cycle outputs P C ZsuppRj 
such that V G -|- P®^. 

In particular, we know that P is of cardinality at most A. Let be the set of all subsets 
of Qu of cardinality at most A. Thus, we know that v G 'I'(Gu) iff there exists J £ lu, & run 
i?i of Gu such that J C supp(i?i) and |i2i| is bounded by Bq, and a set P C Zj such that v G 
'k(Pi)-|-P®^. Let kFj = {'L(P) : i? is a run in Gu bounded by Bq such that J C supp(i?)}. 
Thus, we get that 'L(Gu) = j + 2^®^). We know that kFj is bounded by Bq which 

is single exponential, and Zj is bounded by T = L^, which is also single exponential. 
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Since the cardinality of lu is polynomial in N, we get our claim by applying Lemma 

o □ 


Proof of Lemma 3.1. We will need to introduce the notion of a region. 


Let Sy be the family of all linearly independent subsets of [—y.W]^ containing A — 1 
elements. For each element S of Sy and v G let 4>s{v) be the determinant of the matrix 
Ms^v whose A — 1 columns are the elements of S (ordered in an arbitrary way), and the last 
column is v. From the properties of the determinant, we know that 4>s{v) = 
where aa is the (possibly negated) determinant of the (^ — 1) x (^ — 1) submatrix of 
Ms,v which misses the column v and the row a. In particular, aa € [—C^..C(f,], where 
C(f) = Ha -1 {Y). Intuitively, for each S, </>g^(0) is the A — 1-dimensional subspace containing 
all elements of S. By calculating 4>siv) we can tell whether v is above or below this 
subspace. Let <I>y = {(fs ■ S G Sy}. The cardinality of ^y is bounded by {2Y + 1)^^, and 
its elements have coefficients bounded by C^j). In particular, if G ‘hy, then |(/)(f)| < C(j,\v\, 
and ||(?!)(u)|| < ACti,\\v\\. 

Now, let Lb = [—2B,2B^, and TIb,y be the set of all functions from <hy x Lb to 

{- 1 , 0 , 1 }. 

For r G 'I^B,y, we define strict regions Reg(r), weak regions reg(r), and ray regions r(r) 
as follows: 


Reg(r) = {x G : V(^ G <hy VZ G Lb sgn{4i{x — 1)) = r^^i] 

reg(r) = {x G : V(/) G <hy \/l € Lb sgn{4>{x — 1)) r^p^i > O} 

T(r) = {x G : V(^ G <hy VZ G Lb sgn(^(x)) r^^i > O} 

The following picture shows what is going on for A = 2 and Y = 2>. 



The gray square in the center is L^. From each point of the square, and for each 
tuple of — 1 vectors in [—Y..Y^, we shoot a hyperplane which is parallel to each of these 
vectors. For A = 2, this means that we shoot a line in each direction given by some vector in 












COMPLEXITY OF PROBLEMS OF COMMUTATIVE GRAMMARS 


15 


[—y..y]^. There are 32 directions, and for each of them we get a bundle of lines, by taking 
different points oi Lb to shoot from; such bundles of lines are marked gray on the picture 
above. Each strict region Reg(r) is given by its relation to each of these lines (above, below, 
or on one of these lines). There are four types of regions: 

• empty regions (the relations are inconsistent), 

• a single point (r says that the point is on two (non-parallel) lines at once), 

• a semiline (r says that the point is on a line and above some other line), 

• a wedge (r says that the point is below line li and above line I 2 , where h and I 2 are not 
parallel). 

There are 32 wedge-shaped regions, an infinite number of semiline-shaped regions (bundled 
into 32 packs of parallel semilines), and an infinite number of points. 

Strict regions Reg(r) are disjoint and partition the plane, while the weak regions reg(r) 
are their closures. Ray regions r(r) look similar to Reg(r), but we shoot lines only from 0, 
instead of each point from Lb- There are 32 wedge-shaped and 32 semiline-shaped regions, 
each of them start at 0 (which is included into each r(r)). If Reg(r) is a single point or the 
empty set, then r(r) = {0}. 

The situation is more complicated in three dimensions, and it is also harder to draw. 
We will show how r(r) looks for ^ = 3 and y = 2. From the definition of r(r) we know that 
for each a > 0 we have x G T{r) iff ax G T{r). Thus, it is sufficient to draw only a situation 
on a planar section of M^. Let A = {u G : u > 0, |u| = 1}. The set A is a triangle; this 
triangle is shown on the picture below. 



The 19 white dots on the picture denote elements of [0..y]^; for each such v, its 
projection is shown as a white dot on the triangle (in fact, we should also have white dots 

for elements of [—y.y]^ with negative coordinates, but this would obfuscate the picture too 
much — their projections on our planar section do not fit in the triangle). S is the set of 
pairs of linearly independent elements of [—y..y]^, or pairs of distinct white dots. For each 
S G S, we have the linear function (ps, which equals 0 on both elements of S. These are 
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represented by the black lines: for each two white dots, we have a black line going through 
these white dots. 

Now, let r G TZb,y- The set r(r) is the set of points which are on the side of the 
semispace (p declared by for each p and I (or on this semispace, in case if = 0). In 
our case, semispaces are the black lines. Therefore, r(r) n A could be either an empty set (if 
this is inconsistent - this happens for example when ^ region of the triangle 

bounded by the black lines, or one of the white dots, or one of the points where the black 
lines cross, or a segment of a black line between two points where it crosses the other lines. 
The set T(r) can be described by multiplying T(r) n A by each scalar a > 0. Thus, each 
black line becomes a black plane, each white dot or black line crossing becomes a semiline, 
each black line segment becomes an infinite triangle, and each bounded region becomes an 
infinite cone. 

Regions reg(r) and Reg(r) are harder to visualize — we have to replace each black plane 
by a pack of black planes by moving it by each vector in , and again split Mp by these 
new black planes. 

The following lemma describes the shape of the regions. 


Lemma 3.3. Let r G T^bx- Then: 

• T(r) can be written as T®®”, where C [— 

• reg(r) = Wr + T{r), where Wr C 

In particular, if reg(r) is bounded, then r(r) = {0}. Cj^is bounded polynomially by Y, 
and is bounded polynomially by B and Y. 


For example, let us consider the two-dimensional case from the picture on page 14 


Take r such that reg(r) is an infinite wedge with a vertex in some point w. Then r(r) is a 
congruent wedge with a vertex in point 0, bounded by semilines parallel to some two vectors 
vi,V 2 G [—y..y]^. Tr will be simply {^ 1 ,^ 2 }, and Wr will be {u:}. Our lemma says that w 
is bounded polynomially in B and Y ; since the picture is big enough to already show all 
the 32 wedge vertices, we can take to be (half of) the edge of the square shown on the 
picture. In the three-dimensional case the situation will be more difficult: elements ot Tr 
will be bounded by a polynomial in Y (instead of Y itself), Wr will no longer have to be 
a singleton, and the cardinality of set Tr might be relatively large - for example, on the 
picture on page 15 there are ray regions r(r) n A bounded by five black lines, so we will 
need a Tr with five elements to describe the respective T(r). 

Having introduced the regions, we can use the following lemma: 


Lemma 3.4. Let B,Y G N. Then there is a bounded polynomially in B and Y, such 
that the following holds: 

Let S = IF-FZ®^, where Z C [-y..y]^ and W C [-B..B]^, and r G 7 Zb,y- Then there 
is a linearly independent subset P 2 and Wi C l-Ckn-OsAi^ such that S n reg(r) = 
{Wi -h 21^^) n reg(r). 

This lemma says that, if we restrict ourselves to a single region, we can improve our 
presentation Su = {Wi .21®^) by making the sets Zi linearly independent, while still 
keeping the bound on Wi exponential. Thus, in each region, Su is a union of \Iu\ simple 
bundles bounded by {Ow^ Y). 

Now, we use the following lemma: 




COMPLEXITY OF PROBLEMS OF COMMUTATIVE GRAMMARS 


17 


Lemma 3.5. Let S be a fixed alphabet, dfoj], y G N, and / be a set of indices. Then there 
exists a number = 0((QjJi+ such that the following holds: 

Let Vi = Wj + be a simple bundle bounded by (CjrH Y), for each i £ I, and 
r £ '^qoi.y ■ Then for each v £ iP n Reg(r) there exists a vq £ [—n Reg(r) such 
that for each i £ I we have u G Vi iff uq G 14. 


By applying the lemma above to / = IiU I 2 and taking %I] = 7^ 
hypothesis. 


we obtain our 
□ 


Proof of Lemma 3^, Without loss of generality we can assume that Reg(r) > 0. Indeed, 
for each a £ A, The set <l>y includes the linear function (fa given by (fa{x) = Xa- If > 0 , 
then Xq > 0 for all x £ Reg(r); similarly if < 0. In the second case, we can take the 
mirror image, which is also a region. 


Take </> G <Vy. 

r(r) = n, 


We can write the definitions of reg(r) and r(r) as follows: reg(r) = 


where 




^1 = 


: sgn((/>(x - 1)) > O} , 

: sgn((/)(x)) > O} . 


n 

f] {x£ 

I^Lb 

The sets are half-spaces or closed hyperplanes; (or its boundary) goes through 0, 
and is parallel. 

Let a{v) = A = {u > 0 : a{v) = 1}. The set r(r) is described by r(r) n A: 

for X G A and a > 0, we have ax £ T{r) iff x G r(r) n A. The set A is a bounded 
A — 1-dimensional polytope (simplex), and r(r) n A is its intersection with a finite number 
of closed halfspaces and hyperplanes Thus, r(r) n A is also a bounded polytope. Each 

vertex of this polytope is the point where A — 1 hyperplanes of form (j){v) = 0 (where (/> G <k) 
cross A. The coefficients of (f are bounded polynomially by T, so each vertex can be written 
as = ti/a{ti), where a{ti) is bounded polynomially by some Cj^ (by Lemma 2.3). Let 
Tr = ■ ■ ■ ,tN}- Note that the number of vertices, N, is bounded polynomially. 

It is easy to check that the set reg(r) is upward closed, i.e., if x G reg(r), i £ [1..A], and 
a > 0, then x ati £ reg(r). 

Let Wr be the set of points in reg(r) such that for all a > 0 and i, Wr — ati ^ reg(r). 
We have that reg(r) = Wr + . Indeed, 3 follows from upward closedness, and for C, 

take xo G reg(r). For i from 1 to N, let Xj = Xj_i — aiL, where a* G P is the greatest 
number such that Xj-i — aiU is still in reg(r) - such an a must exist, since reg(r) is closed, 
reg(r) > 0, and Xj_i — ati has a negative coordinate for large enough values of a. We obtain 
a point xn, which must be in Wr- Indeed, suppose that x' = xjv — ati £ reg(r). In that 
case, Xi-i — {ai + a)ti = x' + ajtj £ reg(r) because x' G reg(r) and reg(r) is upward 

closed. This contradicts the assumption that the value of a* was maximum. 

Take x G Wr- Let in case if is a hyperplane, or the boundary hyperplane 

of if is a half-space. Let C = |</> : x G The set := is 

an intersection of hyperplanes, and thus it is a subspace. 

The set n Wr has to be bounded. Indeed, suppose that n Wr is not bounded. 
Take a sequence [vi] of elements of n Wr such that limj_,.oo <T(uj) = 00 . Let Wi = Vi/a{vi); 
we have Wi £ A. Since A is compact, Wi has a cluster point, w. For (f £ <ky and I £ Lb, we 
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have (p{w) = 4>{wi — l)/a{wi) + / a{wi). The first component is zero or has sign 1) 

(since Vi G reg(r)) and the second component tends to 0, thus is either 0, or it has sign 
r{4>, 1). Thus, w G 'r(r), and also w G (the subspace parallel to going through 

0). The set H Wr is a polytope which is unbounded in the direction of w, but this is 
impossible, since x and x + aw cannot be both in Wr (from the definition of Wr). Hence, a 
contradiction. 

Moreover, the set n Wr has to be bounded polynomially. Indeed, the vertices of 
Hx n Wr are points where some A hyperplanes of form cross. These hyperplanes have 


coefficients bounded polynomially, and from Lemma 2.3 we get that the coordinates of 
vertices of Hx n Wr are bounded polynomially. 

Thus, Wr = \Jx&Wr {Hx n Wr) is bounded polynomially. □ 

Proof of Lemma \3.4\ If Reg(r) is bounded, then it is bounded polynomially in B and Y 
(Lemma |3.3[ ), and therefore we are done. Thus, we can assume that Reg(r) is unbounded. 

Suppose that there is a t G r(r) such that t ^ Z®^. We will show that in this case 
Reg(r) and S = W + Z®^ are disjoint, and therefore we are done. 

Indeed, note that Z®^ is a d-dimensional cone (for some d), whose faces are hyperplanes 
going through 0 and parallel to sets of (d — 1) vectors in [—T, T]^. The ray region r(r) 
cannot be subdivided into two smaller regions by such a hyperplane, therefore there is one of 
these hyperplanes, say (f, such that 4>{t) < 0, 4>{T{r)) < 0, and 4>{^) ^ 0- Take x G Reg(r) 
such that X = w + Y1 where w G Since cj){t) < 0, for each I G Lb we have 

< 0, and thus, since x G Reg(r), we have (/>(x — 1) < 0. Since Lb = [—2R, 2B]^, there is 
a lo € Lb such that (j){lo) < (fiw). Thus, (^(x) < (t>{lo) < (fiw) < 4>{w) + Yloti4'{^i) = 
which is a contradiction. 

Thus, we can assume that T(r) C Z®^. Each point in Z®^ can be written as ^ aiZi, 
where ai is positive only for a linearly independent set of vectors in Z (otherwise, if a* are 
positive in a linearly depe nden t subset of 2^, we can modify the values so that one of them 
is 0 - just like in Lemma 2.6). Therefore, r(r) C where J- is the family of all 


linearly independent subsets of Z. Again, each F®^ is a d-dimensional cone, whose faces are 
hyperplanes going through 0 and parallel to sets of (d — 1) vectors in [—Y,Y]^. The ray 
region T(r) cannot be subdivided into two smaller regions by such a hyperplane, thus there 
is a E G J-" such that T(r) C F®^. Let Zq be this E, and let D be determinant of the matrix 
whose columns are Zq (we add unit vectors if there are less than A vectors in Zq). From 


Lemma 2.1 we know that D < Ha (T), which is single exponential. 
Now, let ITi = 5 n 


the sufficient value of C foii will be apparent from the 
sequel of the proof. By induction over |u|, we will prove that each v G S can be written as 
V = wi + .Z®^, where wi gWi. 


< CWn the hypothesis obviously holds. 


> we apply the following Lemma: 


For V G S such that |u| 

For V G S such that |u| 

Lemma 3.6. Let S be a fixed alphabet, and B,Y,D G N. Then there exists a constant 
polynomial in B, Y and D, such that for each r G TIb,y and each v G reg(r), if ||u|| > ifegi, 
then V = VQ + Dt, where vq G reg(r) and t G T{r) n [—(where Cfoiis from Lemma 
3.3). Moreover, vq G Reg(r) iff u G Reg(r). 


Take v G S. By applying Lemma 3.6 iteratively, we know that v = vq + D{ti tx), 

where L G r(r) n and ||unll < 
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On the other hand, we know that v = wq + wi 2 , where wo G W and wi 2 = 
where /3^ G N. From Lemma 2.6 we can assume that /3^ exceed H = Ha (Y) only in a 
linearly independent P C Z. Take a large enough Jo Z H, polynomial in B and Y (the 
required value will be apparent below). We can write u = iuq + +'R’ 2 , where wi G 

and W 2 = where each (3^ > Jq, and P is linearly independent. 

Suppose that, for some i, ti G P®^. From Lemma 


2.4 


we obtain that Dti = YIvgp 
where 7 ^ G Z. Moreover, from Lemma |2.3[ 7 ^, < Jq, if Jq is large enough. Therefore, 
W 2 = W 2 + Dti, where Take v' = v — Dtp, we have v' = W 0 + W 1 + W 2 G 

W + = S. From the induction hypothesis, v' G VFi + .2®^, and Dti ® -2^®^ from Lemma 

2.4 Since v = v' + Dti, we obtain that v G IFi + Z®^. 

Otherwise, {ti : i = 1,... ,K} is disjoint from Therefore, is disjoint from 

P®® (except 0). By the same arguments as in the beginning of this proof, there is a 
hyperplane 4> G 4>v such that 4>{ti) < 0 for each i, and 4>{P) > 0. 

What is the value of 4>{v)7 On one hand, we have 4>{v) = 4>{vo)+D </>(L) < ACsB;^— 

DK. On the other hand, we have (f>{v) = 4>{wo + wi+W 2 ) = (j){wo) + (j){wi) + (p{w 2 ) = 4>{wo) + 

> —BAC^ — Jo\Z\YAC(j). From applying the triangle inequality 
to u = vo + D^j^ti, we get that |i;| < \vo\+DKAC^^ and hence, K > {(^- / DAC^ 

Thus, ACsH ifeel— D ( Oril— / DAQ^ > —BAC^f) — Jq\Z\YAC^ j). This is a contradiction 

for a large enough (but still polynomial) ffej]. □ 


Proof of Lemma 3.5, Let Di be the determinant of Zi (if Zi includes less than A vectors, 
add unit vectors as usual). Let D* be the least common multiple of all Di. We know that 
Di is bounded polynomially in Y, so D* is bounded polynomially by YI^L 

By applying Lemma [3^ (with D = D*) we get constants C fe^i and We know that 
each V G Reg(r) such that |u| > can be written as vq + D*t, where t G T(r). We repeat 
this construction (adding all the f’s together), until we get |uo| < 

Now, we have to show that u G V) iff uq G V). 

Suppose that u G V). Thus, we have v = wq + ^kPk-, where > 0, and (P^) are 
the members of Zk. We can write similarly uq = tco + Ylk hPk- Note that v and vq are in 
the same (ffen, Y)-region r. In particular, v — wq and vq — wq are on the same side of each 
hyperplane going through a member of Lq^ and parallel to ^ — 1 members of [—Y..Y^; 
therefore, since ak > 0, we also have fik > 0. Moreover, since v — vq = D*t, by Lemma |2.4[ 
V — Vo € Zf^. Therefore, ak — 13k has to be an integer, and therefore f3k G N. Thus, uq G V). 
The proof in the other direction goes in the same way. D 


Proof of Lemma\3.6[ Take v G reg(r). From Lemma 3.3 we know that v = w + 'YfotiL, 


where {U ■. i £ {I,..., A^}} = Tj C , and w G [-% 3 l-% 3 ]| • By taking a large 


enough B^^ we can ensure that if |x| > Bj^ then some Oj is greater than D. We have that 
Vq = V — Dti is a proper convex combination of v and vi = v — aJi, which are both in reg(r), 
thus Vq is also in reg(r). And if u G Reg(r), then so is vo {4>{vo — 1) from the dehnition of 
reg(r) is a proper convex combination of 4>{v — 1) and </>(ui — 1), <f{v — 1) has the correct sign, 
and <^(ui — 1) either has the correct sign or is 0 ). D 


4. A HARD GRAMMAR 

As mentioned in the introduction to Section]^ for A > 2 il'(G) is not a union of polynomial 
number of simple bundles, which makes it impossible to use the simple reasoning mentioned. 
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In this section, we will show an example of a grammar over an alphabet with three symbols 
where this is the case. We believe this example is interesting in its own right. 

The proof has two parts. In the first part, we create a sequence of grammars {Gn) 
over four terminal symbols (x, y, and two temporary ones), such that, if we ignore the 
two temporary terminal symbols, the convex hull of T(Gn) is a bounded polygon with 2'^ 
vertices. It is then straightforward to create a grammar over three terminal symbols (x, y, 
z) which has the required property. 

Theorem 4.1. There is a positive grammar Gn of linear size with terminal symbols x, y, 
5n+i and An, and non-terminal symbols Sq and Xn (where Sq is initial), such that the 
vertices of the convex hull of 'I'(Gn) are exactly the points for 

f = 0,..., — 1, and Xn generates only x^, where N = 2”. 

Proof. The proof is by induction. For n = 0 we simply put Sq —)■ 5iylo, Xq —)■ x. 

Take the grammar Gn] we will construct the grammar Gn+i- The symbol Xn+i is 
obtained by the following rule: Xn+i —)■ X^. 

We replace the terminal An with a non-terminal with two transition rules: 

• (1) An —)■ ^^+1 

• (2) An —)• S‘^j^2^riy 

The new vertices of the convex hull will be obtained by choosing one of the two rules for 
An, and applying it consistently. Thus, from each vertex of the 

convex hull of 'I'(Gn), we get the following vertices: 

• ( 1 ) 

. ( 2 ) y‘iN-l-i^i{i+l)/2+N(2N-2i-l)s2WgAJ^-Ai-2 

Now, we replace the terminal with a non-terminal with a single transition rule 
5n+i —)• An+iSn+ 2 - Hence, we get the following vertices: 

• (1) y*X*(*+l)/252*+1^4iV-2*-l 

By taking j = 2N — 1 — f, we can rewrite the second row of vertices as follows: 

. (1) y*x*(*+i)/252!+i4^-2-i 

• (2) y^x^(^+i)/252j^+i^4iv-2j-i 

Note that i runs from 0 to — 1, and j runs from N to 2N — 1. We can thus combine 
our two rows of vertices into one, indexed by i running from 0 to 2N — 1, and thus obtain 
the induction thesis. D 

Now, let G'n be obtained from by removing all the terminals An+i and Sn+ 2 - The 
convex hull of T(G(^) is the polygon P with vertices for i G [O..A^ — 1]. 

Now, we create a new grammar G° over x, y, z whose Parikh image will be geometrically 
the infinite cone with base P. This is done simply by adding a new initial symbol S° to G^, 

Z 0 

with two transitions S° —)• SqS° and S° —)■ 0. It can be easily seen that periods are single 
exponential, and since 4'(G°) is an infinite cone with 2” edges, we cannot write 'k(G°) as 
a union of a polynomial number of simple bundles - indeed, each simple bundle can cover 
only at most 3 edges. 
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5. Complexity results 


In this section, we provide the tight complexity bounds for the pproblems we are considering. 
Some of the results have been previously known (e.g., |Huy84 , Esp97| ), we include all the 
proofs for completeness. 


Theorem 5.1. The following membership problem in commutative regular grammars is in 
P for a fixed S: 

Given: a regular grammar G, v £ 

Decide whether v G ^(G) 

To prove this Theorem, we will need the following two lemmas: 


Lemma 5.2. For P C Q, let 

Tln{P, q) = {'I'(-R) : 12 is a run from q, \R\ < n, and P C supp(12)} . 

For a regular grammar G over S of size A with N nonterminals, the sets 77.„(P, q) for all 
P of cardinality at most k and all n < P can be calculated in time 0{{2B)^\G\N^~^^) and 
space 0{{2B)^N^~^^). 

Proof. Note that a run of length n from q consists of a transition from q to [q'] and a run 
from q' of length n — 1. Thus, we get the following recursive formula: 


Tli{P,q) = {'l'(fi) : d G 5, source(d) = g,target(d) = 0, {( 7 } C P} 

Tln{P,q) = 'JZn-i{P,q)'J'Pn{P - {q} ,q)'J 

{'l'(d) +TZn-iiP,r) : d G d, target(d) = [r], source(d) = g} 

We know that Tln{P, q) ^ [—n..n]^. This allows us to calculate all the sets TZniP-, q) using 
dynamic programming. O 

Lemma 5.3. Let 

'Pniqi, 92 ) = {'f'( 12 ) : P is a path from qi to q 2 and |P| < n} . 

For a regular grammar G over S of size A with N nonterminals, the sets Pn{qi, 92 ) for all 
n < B can be calculated in time 0{{2B)^\G\N‘^) and space 0{{2B)^N^~^‘^). 

Proof. The idea is essentially the same as in Lemma |5.2[ We use the following recursive 
formula: 


'Po{qi,q2) = 0 if gi / ^2 

'Poiq^q) = { 0 } 

'Pn{qi,q2) = Pn-l(P, Q')U 

{'l'(d) +Vn-i{r,q 2 ) ■ d G d,target(d) = [r], source(d) = qi} 

□ 
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Proof of Theorem 5.1 . The algorithm is as follows. 
(1) Let Be be the bound on Ri from Theorem 2.13 
Using Lemma 


( 2 ) 

(3) 

(4) 


Using Lemma 


5.2 


5.3 


P\ 


we find the sets qi) for all P such that 

we find the sets VNiQ, q) for all P such that |P| < 
For each P such that I PI < A: 


For each linearly independent subset Z of Uq^pV^iQ, q)- 
For each element w of Pbq(P, qj): 

V — w G 


< A. 

A. 


(5) Otherwise return NO. 


return YES. 


Since Z is linearly independent, we can check whether v — w G using Gaussian 

elimination. 

It is straightforward from Theorem |2.13| that this algorithm will return YES if u G T(G). 
It is also straightforward that v G 'I'(G) if the algorithm answers YES. 

This algorithm runs in time O . It uses space 0((2P)"^A^^®^). 

□ 


Theorem 5.4. The membership problem in commutative grammars is in NP. 
This result was previously known |Esp97|. 


Proof. From Theorem 2.13 we know that if u G 'k(G), then v = T(Pi) + ^ 'I'(Gfc)nfc, where 
Pi and Cfc are bounded exponentially, and Ck are linearly independent. We can simply 
guess Pi and Ck- □ 


Theorem 5.5. The inclusion, universality, and disjointness problems in commutative 
grammars over an alphabet of fixed size are in II®. 

Proof. We will consider inclusion; the other problems can be solved in the same way. 

Let Gi and G 2 be two commutative grammars in normal form with at most N non¬ 
terminals each, over the same fixed alphabet S. 

By Theorem |3.2t there exists a number which is single exponential in N, such that 
'I'(Gi) C 'I'(G 2 ) iff 'I'(Gi) n C T(G 2 )n Thus, we need to check 

the membership in ^(^ 2 ) for all the elements of the set [—n T(Gi). This can 
be done in II^. □ 

Theorem 5.6. The inclusion and universality problems in regular grammars over an 
alphabet of fixed size are in coNP. 

Proof. The proof is just like for Theorem |5.5[ Again, we need to check the membership 
in ^(^ 2 ) for all the elements v of [— Poi-. Poi] ^ H T(Gi). However, this time checking 
membership of v can be done in P, so the whole algorithm works in coNP. O 

Theorem 5.7. The problem of inclusion of commutative grammars over S = {a} is H^hard. 

This result was previously known |H^ . 

Proof. We will reduce the following problem (3-CNF-QSAT2). Let the clauses Gj (0 < 
j < m) be disjunctions of at most three literals of form Xi, -iXj, -<yi = or -<yi. Does the 
formula 

VxoVxi... ... 3yi_i Ag 

j<m 
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hold? Quantifiers run over two possible values of each variable (either Xj or -iXj is true). 

The symbol Ai {i < k) generates a^\ for i < k. This can be realized with the following 
rules: a, Ai ^ for i > 0. 

The symbol Aj (i < k) generates or nothing. This can be realized with the following 
rules: Ai —>■ 0, —)• Ai. 

The symbol Cj (j < m) generates {a? . This can be realized with the following rules: 

Cj ^ Ajf_iAf^_i, Cj y Cj—\Cj—\Cj—\Cj—\ for j > 0. 

The symbol Cj (j < m) generates (a^ or nothing. This can be realized with the 
following rules: Cj —)> 0, Cj Cj. 

The symbol Xi (i < k) has the following two rules: Xi —)■ Ai Ylj<cm(^j ■ ^ 

C,)\nj<miCj:^x,eC,). 

The symbol Yi {i < 1) has the following rules: T* I\j<mi^j ■ Vi ^ Cj) \ Hj^^iCj : 
-^Ui G Cj). 

The symbol 5i has the following rules: 5i — Y\i<ik Y\j<m 

The symbol S 2 has the following rules: S 2 —)■ ni<A: ni<z 

We ask whether the language generated by Si is a subset of the language generated by 
52. 

Note that each Ai is generated at most once, and each Cj is generated at most three 
times. The definitions of these symbols ensure that we can treat them as independent: for 
each combinations of Ai and Cj in S'!, we need to find the same combination in 82 . 

The language generated by Si has 2^ elements (for each Ai, we can either add it or not). 
These correspond to possible valuations of variables of Xi. To match the given element of Si 
in S 2 , we need to make the same choices in Xi. We also need to generate C'oC'iC '2 ... Cj-i- 
This means that we have to make choices in Yi which generate all the clauses which were 
not covered by our choices in Xi. Therefore, Si C S 2 iff the formula is true. D 


Theorem 5.8. The problem of universality (is 'k(G') = Z^?) of commutative grammars 
over S = {a} is hard. 


Proof. We use the same reduction as in Theorem 5.7 It is enough to have a symbol S^ which 
generates the complement of Si. Indeed, add a new symbol ^ 4 , with rules ^4 —>• 52 | 53 . The 
universality of ^4 is equivalent to 5i (the complement of S 3 ) being included in S 2 , which is 
equivalent to our 3 -CNF-QSAT 2 formula being true. 

This can be done as follows: 

The symbol C^ generates 0, 2, or 3 copies of Cj. This can be done with the following 
rules: Cf ^ 0 \CjCj\CjCjCj. 

The symbol Z~^ generates any number of a’s which is at least 2^4”^. This can be realized 
as follows: —)• 

The symbol Z~ generates any negative number of a’s. This can be realized as follows: 
Z- ^ a-^Z-|o-^ 

Now, we can define S3 as S3 


z^\z^\n^<kAin 


CH 
lj<m '-'j ■ 


□ 


Proposition 5.9. The problem of membership in commutative grammars over S = {a} is 
NP hard. 


Proof. We reduce the 3-CNF-SAT problem, 
except that we take A: = 0. 


The proof is the same as in Theorem 5.7 


□ 
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Proposition 5.10. Let G be a commutative regular grammar over S = {a}. Then the 
problem of deciding universality is coNP-hard. 

Proof. We reduce the 3CNF-SAT problem. Let cj) = Ai<i<A:^* ^ 3CNF-formula with 

n variables xi . .. (which can be 0 or 1) and k clauses. Let pi,P2, ■ ■ ■ ,Pn he n distinct 
prime numbers. Let i G [1..A:]. Suppose that clause Ci is of form Vfce[i 3 ] = W*.- Our 

grammar will have states Sj, where 0 < j < Mi = PaxPa 2 PaP-, we have cyclic transitions 
S] A 0 each j not satisfying Vfee[i.. 3 ]i modpa^ = Va^. We also 

have transitions sq A Sq for each i. 

From simple number theoretic arguments we get that x ^ 0(0) iff the formula (p is 
satisfied for Xi = x mod pi. □ 


Proposition 5.11. For a commutative regular grammar G over S (whose size is not fixed), 
and K G N^, the problem of deciding whether K G 0(0) is NP-hard. 

Proof. We show a reduction from the Hamiltonian circuit problem. Let (H, E) be a graph. 
We take E = Q = V, and for each edge (ui, V 2 ) we add a transition A U 2 - We pick an 
initial state sq and add a final transition sq A 0 . The graph (H, E) has a Hamiltonian circuit 
iff iL= (1,1,...) G T(G). □ 


Proposition 5.12. The disjointness problem in commutative regular grammars (over an 
alphabet of unhxed size) is coNP-complete. 


Proof. From Proposition |5.1l| we know that membership is NP-hard. We can easily construct 
a grammar G 2 such that 'I'(G 2 ) = {K} (where K is from the proof of Proposition 5.11), 
and ask for non-disjointness of G and G 2 . 

From Theorem |5.4| we also know that membership is in NP (even for non-regular 
grammars). We reduce non-disjointness of Gi and G 2 to membership in the following way. 
It is sufficient to check whether 0 G 'I'(Gi) — 'I'(G 2 ), where 'I'(Gi) — 'I'(G 2 ) = {ui — U 2 : ui G 
'I'(Gi), U 2 £ 'I'(G 2 )}. We create a grammar Gi — G 2 such that 'I'(Gi — G 2 ) = 'k(Gi) — 'I'(G 2 ): 
first, create the grammar —G 2 such that 4'(—G 2 ) = {—v ; v G ^'(^ 2 ) by replacing each 
production by its negative, and then create the grammar Gi — G 2 by replacing each final 
transition in Gi (i.e., <5 such that target((5) = 0 ) by transitions going to each initial state of 
G2. □ 


The following table summarizes the complexities of various problems regarding com¬ 
mutative grammars. Alphabet size F means fixed, and U means unhxed. We include the 
very recent result regarding unhxed alphabets |HH14j that inclusion for regular grammars 
over an unhxed alphabet is coNEXP hard; it is likely that the proof can be also adapted for 
universality. On the other hand, it is known from |Huy85| that inclusion and universality 
for context-free grammars over an unhxed alphabet is in coNEXP. For completeness, we 
also include N (the non-commutative case - note that the membership problem is actually a 
different problem in the non-commutative case, since we cannot encode long words succinctly 
with the length of the word in binary); these results are known from other sources l [MS72j . 
also see |HU79j for a reference). The letter c means complete. Note that inclusion and 
equivalence problems easily reduce to each other. 
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regular languages 

alphabet size 

1 

2 

F 

U 

N 

membership 

P 

P 

P 

NPc 

P 

universality 

coNPc 

coNPc 

coNPc 

? 

PSPACEc 

inclusion 

coNPc 

coNPc 

coNPc 

coNEXPc 

PSPACEc 

disjointness 

P 

P 

P 

coNPc 

P 

context-free languages 

alphabet size 

1 

2 

F 

U 

N 

membership 

NPc 

NPc 

NPc 

NPc 

P 

universality 

nfc 

RPc 

RPc 

? 

undecidable 

inclusion 

nfc 

RPc 

RPc 

coNEXPc 

undecidable 

disjointness 

coNPc 

coNPc 

coNPc 

? 

undecidable 


6 . Conclusion 

The table above contains question marks for languages of unfixed size. It is known that 
these problems are H^-hard for context-free grammars, and they are in NEXP. Our method 
heavily uses the fact that the size of the alphabet is hxed, so we probably cannot easily 
generalize it. As mentioned above, coNEXP hardness of inclusion for regular grammars has 
been shown recently |HH14j . 

A natural question extending this research is counting. We can answer the question 
whether v G 'I'(G'), but what about the number of paths (or words) which lead to the given 
v7 This number is exponential in |u|, and thus it could be very large, so we cannot always 
hope for the exact answer—but we could count up to some threshold T, modulo M, or count 
approximately. We can answer the question whether ^(Gi) C T(G 2 ), but is the number of 
paths smaller in the first case than in the second case? 

Thanks to everyone on AUTOBOZ 2009 for the great atmosphere of research, especially 
to Slawek Lasota for introducing me to these problems. Also I would like to thank Anthony 
Widjaja Lin for our collaboration on the merged paper |KT10j . and the anonymous referees 
for their insightful comments. This work is supported by the Polish National Science Centre 
Grant DEC - 2012/07/D/ST6/02435. 
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